CDS

Accession Number TCMCG039C01822
gbkey CDS
Protein Id XP_024026772.1
Location join(140473..140844,141297..141523,141623..141690,142254..142325,142557..142909,143096..143164,143253..143341,143968..146160,146258..146335,146458..146569,146733..146783,146904..146954)
Gene LOC21391495
GeneID 21391495
Organism Morus notabilis

Protein

Length 1244aa
Molecule type protein
Topology linear
Data_file_division PLN
dblink BioProject:PRJNA263939
db_source XM_024171004.1
Definition dentin sialophosphoprotein [Morus notabilis]

EGGNOG-MAPPER Annotation

COG_category S
Description Occludin homology domain
KEGG_TC -
KEGG_Module -
KEGG_Reaction -
KEGG_rclass -
BRITE ko00000        [VIEW IN KEGG]
ko04121        [VIEW IN KEGG]
KEGG_ko ko:K11807        [VIEW IN KEGG]
EC -
KEGG_Pathway -
GOs -

Sequence

CDS:  
ATGTACGGAGGCTCCTCCAAGCTCGGCCGTGGCGGCGGCGGCGGCGCCGGCCGTGGAGCCGGAGCCAAGCGCCTTAGCTCGTCCTTTCCTATGATGCCACCCCACCGTCCCTCTGCCCCCGGCGGCGCCAGCCGCCTCCCCCTTGGCGGCTCCGGCTCATCCGCCAATCCTCGAAGCCGCGTGTCGGGACTGAAGGCTCCGGCGGCGGCGCCGGGCACGGAAGAGACGTTCAGCTTGGTGTCAGGCAATAATCCGCTAGCCTTTGCAATGATCATCAGGCTGGCGCCGGATTTGGTCGACGAAATCAGACGCCTCGAAGCCCAGGGTCGGACCACCCGTATCAAATTCGATTCCAATAATTCCAATGGAAATGTTATTGATGTTGGTGGCAAGGAGTTCAGATTCAGATGGTCGCAAGAGAATGGTGATCTATCAGATATATACGAAGAAGGTCAAAGCGGTGAAGATGGTAATGGTTTGCTTGTCGAATCCGGTAGTGCATGGCGCAAGCTAAATATGCAGCGTATCTTGGATGAGTCTACCACAAGCCGTGTTAAAAAGCTTTCAGAGGAAGCCGAGCGCAAGAAAGAATTGCGCCAAGCCATTGTGTTAGAGCCCGGGAATCCATCTATGAAGAGTCAAATAAAGCAGTTAGCTGCTGTTGAGACTAATCCATGGAAGCATTTCAAACAGAAGAAAGAGCCACCACCTAAGAAAAGGAAAATCGAACCACCTCAAGTAGGAGGTCACCCTAAATCTGCATATAAATCTGGAATATCATCAACAACTACTGTCAAGAGCAGACACGCATCATCTCCAGTTCCATCTCCACCTGAGCAATCCAGACCTTCAACATCTCCATTAAGAAATGTAAATATTTCCAAGAGTCATGGAAGTGTAGACGATGTCATAAATCAAGTGATCGGTAAAGACAAAGCTGCTGCTAGCTCCGACAAAGAAATCCCAGCCAAGGCTACTACTCTAGTACGTGAAGCAACAGGACGTAAGAGCAATATTGGAGCTAAACCAACGGATTTGCAGAGTATCTTGATTAATCTGCTTAGGGAAAATCCGAAGGGGATGAGCATGAAGACTTTGGAAAAAGCCGTTGGAGATTCAATTCCCAACTCTGGAAAGCAAATTGAGGCCATAATGAAGAAAATTGCAACTTTCCAAGCGCCAGGAAAATATCTCTTGAAACCTGGACTGGACACAGAAAGCTTCACGAAACGCTCATCTGAAAGCGAAAGTTCTCCAGAAGAGAATCTTCATCAGACAGCCGCTGCAGAAGACAATCGTGATCAGATAAACGCTCCGGAACTGGATCTTAAAGAGAGAGCCCCTTCTAATGAATTTGAGGAACAGGGACAGTTGAACTCTAACCATGGAGAAGAGCCCAGTGCATTGGAAAAAATTGATATCCAACAACATTCACCTGATCTGTTTGGTGAAAAAAAGGTCTCTGACAATAGTGAAGGGCATGCTGGGAGTTCTAGCGATAGCGGAAGTGACAGTGATAGTGAAAGTGACAGCAGCGATAGTGATAGTGGTAGTGGAAGCCCTAGTAGAAGCAGAAGTAGAAGTAGAAGCCCGGTGAGTGGGAGTAGTAGTGATAGTGACAGTGATGCATCTTCCAACAGCAAGGAGGGTTCAGATGTGGATGTGGATATTATGACAAGTGATGATGACAAGGAACCCAAGCATAAGTTGCAATCTAAACCAGCATTCTCAAGATCACCTGTTCAATGGGGAAGTCCTGATGGCAAGCCTGTGCAGGGTGGCAATGAGGAGAAGCAGGATGATCATGAATTGGATCTTGTTGAGATTGAGAATGATGTTCCTGATGGTAATAAACGGGAAACTGAAATTGGTACTTCTGCTCACAAAGATTTTGAGAAACCTATGGAAAAGACCAGTACTTTTTCACCTGATCGAGATAAGCTCCAAGAGCGCCAAAATTTTATAGGAAGTTTGTTTGATGAAGGGGACGATGCTGTTAGAGTTGGCTCCAGGTATGAACACTCTAATAGCTCTGAGAAGATATCTAAAGGAAAATACAAAAGGGGCATGGAGGTAAAACGTTCAGATGATAAATCTGAGTATGCAAAAAGGTCCAAAACAGATGCCTCAAGCCAGGCTCCTGTTTCTGGAGGCAGGGATGTCCAGTTCCAGGAAAGTTCTCATAGATTGCCTTCTAACAGACTCATTGAAGATCCCAATAGGGACCCCATCATTCAAGCTATAAATGGAGTTGATAGGGATGGTGACGGTGAGCTCACCATACAGAAAGGAAATAACCAATTGTTTTCTGGAAAATCTAGTGCAGATGTTCAACACTCAGGTACAAGGTTATTTGAACAAAGTGCTCCTTCGAAGATTCCTGATACATCAGAGAGAATGCATAGTTATGCTGAAAGCTTGGGACATGGCCGTAAATATTCTGAAAAGAGCTCTCATGTGCATGAAGGTTTACCTTCGCAAAAGGCTAAATTTCTTACAGATGCCAAATATGAGGGTGGTTATGCTAATGAGAAAAGGGTTCCAAAAAATCCCAGGGAAGGTGGTGTGAGGGGCAAACAGTCAGTTCCCTTTGATTCACACTACAAGAAACATGGTGAAGTAGTTGGAAAGTTCAAGAATGCTGGACAGGTTTCCGGCTCCTTCCTCAGTACTTCACCAAAGGATCACAGTAGAGCTGGTGTAGATAAATCCCCTGCTCTTAATGGGAGAGGCAATAGACTCCAAAGAGAGTATTCAGACTTGGAGTTGGGTGAACTTCGTGAGCCCCTTCCAGAGGAAGCACCTGTTAAAAAGCAATTTGAAAGAAAAAGTTCATTTAAACAATCGGAGAACAAACCGGACAGTTCAGATAACTGGAATTTGGATATGATTAAAGGGAAGCCTGCTGAAAAGGCAACTTTAGATTTGGGAAAGTCGTCTTCACCTGACCCAAACACCAAGGGTCCCAGCAATTTGGAAGGCTCAAATAAAAAGAGGAAACAAGAAGACTGTGTTGAAGATTTAACATGGTCTCAACATAAGGTTATGCAATCTCAATCACATTCGAGACTAGATAATGTTGAGTTTGGGTTTCAGTCCAGCAATTTGGCAGAGACAAATGGTGCTCGTCAAAATGAAGGTGGAGTCAGACTGGGGAGTGCTCCTGAAGGCTATGGAGAAAGCAACAAGAAAGCACCCGCCCCTCAGCTACATGATACCAGACGAGAACCAGTTTCCCACTCCATGAAGACGAAAGAAAGAAAAAGATTCACGACTAGTACAGTGGCGGAGTTACCTGATGGCCGCACTGAGTCACTTTTGGCAGAAGGCAACAACAGTGAGCGAAAGAGAAGGGATTCCTCCTCTGACGAAAATAGTTGCTCCTATTCTAAGTATGAAAAGGATGAGCCAGATATCAAGGGCCCAATAAAGGATCTTTCTCAGTACAAAGAATACGTGCAGGAGTATCACGATAAGTATGATTCATACTGCTCCCTCAACAAGATCCTAGAGAGTTACAGGAAGGAGTTTCAGAAACTGGGAAAGGACCTTGAGCATGCTAAAGGCAGAGATATGGAGAGATATAATAACGTCTTGGAGCAGCTGAAGGAATCCTATCGTCAATGTGGAACGAGACACAAGAGGCTGAAGAAAATATTTGTGGTGCTTCATGAAGAATTGAAGCACCTTAAGCAAAGGATTAAAGATTTCGCACTTTCTTATTCAAGAGATTGA
Protein:  
MYGGSSKLGRGGGGGAGRGAGAKRLSSSFPMMPPHRPSAPGGASRLPLGGSGSSANPRSRVSGLKAPAAAPGTEETFSLVSGNNPLAFAMIIRLAPDLVDEIRRLEAQGRTTRIKFDSNNSNGNVIDVGGKEFRFRWSQENGDLSDIYEEGQSGEDGNGLLVESGSAWRKLNMQRILDESTTSRVKKLSEEAERKKELRQAIVLEPGNPSMKSQIKQLAAVETNPWKHFKQKKEPPPKKRKIEPPQVGGHPKSAYKSGISSTTTVKSRHASSPVPSPPEQSRPSTSPLRNVNISKSHGSVDDVINQVIGKDKAAASSDKEIPAKATTLVREATGRKSNIGAKPTDLQSILINLLRENPKGMSMKTLEKAVGDSIPNSGKQIEAIMKKIATFQAPGKYLLKPGLDTESFTKRSSESESSPEENLHQTAAAEDNRDQINAPELDLKERAPSNEFEEQGQLNSNHGEEPSALEKIDIQQHSPDLFGEKKVSDNSEGHAGSSSDSGSDSDSESDSSDSDSGSGSPSRSRSRSRSPVSGSSSDSDSDASSNSKEGSDVDVDIMTSDDDKEPKHKLQSKPAFSRSPVQWGSPDGKPVQGGNEEKQDDHELDLVEIENDVPDGNKRETEIGTSAHKDFEKPMEKTSTFSPDRDKLQERQNFIGSLFDEGDDAVRVGSRYEHSNSSEKISKGKYKRGMEVKRSDDKSEYAKRSKTDASSQAPVSGGRDVQFQESSHRLPSNRLIEDPNRDPIIQAINGVDRDGDGELTIQKGNNQLFSGKSSADVQHSGTRLFEQSAPSKIPDTSERMHSYAESLGHGRKYSEKSSHVHEGLPSQKAKFLTDAKYEGGYANEKRVPKNPREGGVRGKQSVPFDSHYKKHGEVVGKFKNAGQVSGSFLSTSPKDHSRAGVDKSPALNGRGNRLQREYSDLELGELREPLPEEAPVKKQFERKSSFKQSENKPDSSDNWNLDMIKGKPAEKATLDLGKSSSPDPNTKGPSNLEGSNKKRKQEDCVEDLTWSQHKVMQSQSHSRLDNVEFGFQSSNLAETNGARQNEGGVRLGSAPEGYGESNKKAPAPQLHDTRREPVSHSMKTKERKRFTTSTVAELPDGRTESLLAEGNNSERKRRDSSSDENSCSYSKYEKDEPDIKGPIKDLSQYKEYVQEYHDKYDSYCSLNKILESYRKEFQKLGKDLEHAKGRDMERYNNVLEQLKESYRQCGTRHKRLKKIFVVLHEELKHLKQRIKDFALSYSRD